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(57) Abstract 

Model systems of Alzheimer's disease comprise a DNA sequence encoding an amy- 
loid precursor protein (APP) isoform or fragment that has an amino acid substitution. The 
substituted amino acid may be other than valine at the amino acid position corresponding 
to amino acid residue position 717 of APP770. Methods of determining genetic predisposi- 
tion to Alzheimer's disease are also disclosed. 
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TF.ST AND MODFT- FOR ALZH FTMFR'S DISEASE 

O 

BACKGROUND OF THE INVENTION 
Alzheimer's disease is a progressive disease known generally as 
5 senile dementia. Broadly speaking the disease falls into two categories, namely 
late onset and early onset. Late onset, which occurs in old age (65 + years), may 
be caused by the natural atrophy of the brain occurring at a faster rate and to a 
more severe degree than normal. Early onset Alzheimer's disease is much more 
infrequent but shows a pathologically identical dementia with diffuse brain atrophy 
1 o . which develops well before the senile period, Le,., between the ages of 35 and 60 
years. There is evidence that one form of this type of Alzheimer's disease shows 
a tendency to run in families and is therefore known as familial Alzheimer's 
disease (FAD). 

In both types of Alzheimer's disease the pathology is the same but 
15 the abnormalities tend to be more severe and more widespread in cases beginning 
at an earlier age. The disease is characterized by two types of lesions in the brain, 
these are senile plaques and neurofibrillary tangles. 

Senile plaques are areas of disorganized neuropil up to 150/xm 
across with extracellular amyloid deposits at the center. Neurofibrillary tangles 
20 are intracellular deposits of amyloid protein consisting of two filaments twisted 
about each other in pairs. 

The major protein subunit, 0-amyloid protein, of the amyloid 
filaments of the senile plaque is a highly aggregating small polypeptide of 
approximate relative molecular mass 4,500. This protein is a cleavage product of 
25 a much larger precursor protein called amyloid precursor protein (APP). 

At present there is no known effective therapy for the various forms 
of Alzheimer's disease (AD). However, there are several other forms of dementia 
for which treatment is available- and which give rise to progressive intellectual 
deterioration closely resembling the dementia associated with Alzheimer's disease. 
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A diagnostic test for AD would therefore provide a valuable tool in the diagnosis 
and treatment of these other conditions, by way of being able to exclude 
Alzheimer's disease. It will also be of value when a suitable therapy does become 
available. 

5 Also important is the development of experimental models of 

Alzheimer's disease that can be used to define further the underlying biochemical 
events involved in AD pathogenesis. Such models could presumably be employed, 
in one application, to screen for agents that alter the degenerative course of 
Alzheimer's disease. For example, a model system of Alzheimer's disease could 
10 be used to screen for environmental factors that induce or accelerate the 
pathogenesis of AD. In contradistinction, an experimental model could be used 
to screen for agents that inhibit, prevent, or reverse the progression of AD. 
Presumably, such models could be employed to develop pharmaceuticals that are 
effective in preventing, arresting, or reversing AD. 

15 

SUMMARY OF THE INVENTION 
The present invention provides model systems of Alzheimer's 
disease, wherein the model system comprises a DNA sequence encoding an 
amyloid precursor protein (APP) ispform or fragment that has an amino acid other 
2 0 than valine at the amino acid position corresponding to amino acid residue position 
717 of APP770. 

In a first embodiment, the present invention provides an isolated 
DNA sequence that encodes an amyloid precursor protein (APP) isoform or 
fragment that has an amino acid other than valine at the amino acid position 
25 corresponding to amino acid residue position 717 of APP770. 

In a second embodiment, the present invention provides a transgenic 
nonhuman animal that harbors at least one integrated copy of a human DNA 
sequence that encodes an amyloid precursor protein (APP) isoform or fragment 
that has an amino acid other than valine at the amino acid position corresponding 
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to amino acid residue position 717 of APP770. 

In a third embodiment, the present invention provides a transgenic 
nonhuman animal wherein at least one of the endogenous nonhuman APP alleles 
has been completely or partially replaced by all or a portion of a human APP gene 
5 that includes a codon 717 that does not encode valine. 

In a fourth embodiment, the present invention provides cells, 
typically mammalian cells and preferably mammalian cells of the neural, glial, or 
astrocytic lineage, that have been transformed or transfected with a heterologous 
DNA sequence, or have been derived from a transgenic nonhuman animal, wherein 

10 the cells express an amyloid precursor protein (APP) isoform or fragment that has 
an amino acid other than valine at the amino acid position corresponding to amino 
acid residue position 717 of APP770. In accordance with standard protocols, 
cultured human cells, either primary cultures or immortalized cell lines, may be 
transfected, either transiently or stably, with a mutant APP allele so that the 

15 cultured human cell expresses a mutant APP polypeptide. 

In a fifth embodiment, the present invention provides a method of 
producing transgenic nonhuman animals and transformed cells that contain a DNA 
sequence encoding an amyloid precursor protein (APP) isoform or fragment that 
has an amino acid other than valine at the amino acid position corresponding to 

20 amino acid residue position 717 of APP770. 

In a sixth embodiment, the present invention provides a method of 
producing, free from other human proteins, a human amyloid precursor protein 
(APP) isoform or fragment that has an amino acid other than valine at the amino 
acid position corresponding to amino acid residue position 717 of APP770. 

25 In a seventh embodiment, the present invention provides a human 

amyloid precursor protein (APP) isoform or fragment, free from other human 
proteins, that has an amino acid other than valine at the amino acid position 
corresponding to amino acid residue position 717 of APP770. 
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In an eighth embodiment, the invention provides a method for 
detecting an APP allele that is linked (i.e., cosegregates with) a genetic 
predisposition to Alzheimer's disease, particularly early onset AD, wherein such 
a pathognomonic APP allele is detected by determining that codon 717 of the allele 
does not encode valine. Preferably, a pathognomonic APP allele is detected when 
codon 717 is determined to encode either isoleucine, glycine, or phenylalanine. 
Thus, methods for locating the presence of genetic alterations associated with 
Alzheimer's disease are provided. This diagnostic method may be used to predict 
the development of the disease prior to onset, for genetic screening, or to detect 
a specific mutation in an experimental nonhuman animal or a cell. 

In a ninth embodiment, the invention provides a human variant APP 
polypeptide free of other human proteins, typically present in a cell of a nonhuman 
animal. The invention also relates to an isolated nucleic acid encoding such a 
polypeptide and to uses and applications of such nucleic acid as are described 
above in relation to the specific embodiment of the invention which involves an 
amino acid substitution at position 717 (as defined in relation to APP770). 

According to one aspect of the invention there is provided a method 
for detecting the presence, in a nucleic acid or other sample removed from a 
subject, of the gene for Alzheimer's disease comprising identifying a genetic 
alteration in a gene sequence coding for APP. Such genetic alterations may 
include mutations, insertions or deletions. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 illustrates a first pedigree in which early onset AD is 
apparently inherited as an autosomal dominant disorder. The average age of onset 
in this family is 57±5 years. Black symbols denote affected individuals and 
oblique lines indicate individuals who are deceased. Females are denoted by 
circles and males by squares. Triangles are used in the present generation to 
preserve anonymity. In generation II the spouses of the two affected brothers were 
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sisters. Samples were available from the 13 individuals whose haplotypes are 
illustrated, from a further 19 children and spouses of these individuals and from 
7 more distantly related unaffected individuals. Beneath the pedigree are 
ideograms of the two chromosomes 21 in each individual of the third generation 
5 at four loci on the long arm of the chromosome. The linkage data suggest that the 
black chromosomes were inherited from the affected fathers. 

Fig. 2 shows an autoradiograph of a sequencing gel from part of 
exon 17 of the APP gene in a normal and an affected individual from the Fig. 1 

o 

pedigree showing a single base pair change at base pair 2149 in the affected 
10 individual. This C to T transition leads to an amino acid substitution of a valine 
by an isoleucine at codon 717. 

Fig. 3 shows part of the amino acid sequence encoded by exons 16 
and 17 of the APP gene showing the mutation valine to isoleucine (V to I) within 
the transmembrane domain and the mutation causing HCHWA-D (E to Q) in the 
15 extracellular domain. The shaded region of the transmembrane domain and the 
boxed amino acids of the extracellular domain represent the sequence of the 
deposited jS-amyloid peptide. Adapted from Kang et al (1987) Nature 325:733. 

Fig. 4 shows Bell digests of the exon 17 PCR product from 
unaffected and affected individuals in an early onset AD family showing 
20 co-segregation of the restriction site and the disease. 

Fig. 5 shows the pedigree of family F19, together with D21S210 

data. 

Fig. 6 shows APP exon 17 sequences in an affected and unaffected 
member of F19. In the affected member there is a G- >T transition at position 2150. 
25 Fig. 7 shows the sequence of APP695. 

Fig. 8 shows the sequence of APP751. 
Fig. 9 shows the sequence of APP770. 



WO 92/13069 



PCT/GB92/00123 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

The accumulation of ^-amyloid protein (A4) in particular brain 
regions is one of the main pathologic characteristics of Alzheimer's disease. The 
/J-amyloid protein is an approximately 4 kD protein (39 to 42 amino acids) which 
5 is derived, as an internal cleavage product, from one or more isoforms of a larger 
amyloid precursor protein (APP). There are at least five distinct isoforms of APP 
containing 563, 695, 714, 751, and 770 amino acids, respectively (Wirak et al. 
(1991) Science 253:323). These isoforms of APP are generated by alternative 
splicing of primary transcripts of a single gene, designated the APP gene, which 

10 is located on human chromosome 21. It is known that most of the APP isoforms 
are glycosylated transmembrane proteins (Goldgaber et al. (1987) Science 
235:877), and that four of the isoforms, AA563, APP714, APP751 and APP770, 
have a protease inhibitor domain that is homologous to the Kunitz type of serine 
protease inhibitors. The /J-amyloid (A4) segment comprises approximately half of 

15 the transmembrane domain and approximately the first 28 amino acids of the 
extracellular domain of an APP isoform. 

Proteolytic processing of APP in vivo is a normal physiological 
process. Carboxy-terminal truncated forms of APP695, APP751, and APP770 are 
present in brain and cerebrospinal fluid (Palmert et al. (1989) Proc. Natl. Acad. 

20 Sci. U.S.A. 86:6338; Weidemann et al. (1989) Cell 57:115) and result from 
cleavage of the APP isoform at a constitutive cleavage site within the A4 peptide 
domain of an APP isoform (Esch et al. (1990) Science 248:1122). Normal 
proteolytic cleavage at the constitutive cleavage site yields a large (approximately 
100 kD) soluble, N-terminal fragment that contains the protease inhibitor domain 

25 in some isoforms, and a 9-10 kD membrane-bound, C-terminal fragment that 
includes most of the A4 domain. 

Generation of pathogenic /J-amyloid (A4) protein appears to be the 
result of aberrant or alternative proteolytic processing of APP, such that normal 
cleavage at the constitutive site within the A4 domain does not occur, but rather 
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cleavage occurs at two specific sites which flank the A4 domain. One of these 
aberrant cleavage sites is in the transmembrane domain and the other aberrant 
cleavage site is located approximately at the N-terminus of the first 28 amino acids 
of the extracellular domain (see Fig, 3). Such aberrant proteolytic cleavage 
5 produces the 0-amyloid A4 polypeptide which is prone to forming dense 
amyloidogenic aggregates that are resistant to proteolytic degradation and removal. 
The resultant 0-amyloid aggregates presumably are involved in the formation of 
the abundant amyloid plaques and cerebrovascular amyloid that are the 
neuropathologies hallmarks of Alzheimer's disease. However, the exact aberrant 
1 0 cleavage sites are not always precise; p-amyioid molecules isolated from the brain 
of a patient with AD show N- and C- terminal heterogeneity. Therefore, the 
aberrant cleavage pathway may involve either sequence-specific proteolysis 
followed by exopeptidase activity (creating end-heterogeneity), or may not be 
sequence-specific. 

15 The APP gene is known to be located on human chromosome 21. 

A locus segregating with familial Alzheimer's disease has been mapped to 
chromosome 21 (Hyslop et al. (1987) Science 235:885) close to the APP gene. 
Recombinants between the APP gene and the AD locus have been previously 
reported (Schellenberg et al. (1988) Science 241:1507). The data appeared to 

20 exclude the APP gene as the site of any mutation that might cause FAD (Van 
Broekhoven et al. (1987) Nature 329:153; Tanzi et al. (1987) Nature 329:156). 

Recombinant DNA technology provides several techniques for 
analyzing genes to locate possible mutations. For example, the polymerase chain 
reaction (Bell (1989) Immunology Today, 10:351) may be used to amplify specific 

25 sequences using intronic primers, which can then be analyzed by direct 
sequencing. 

* 

Researchers working in the area of the hereditary cerebral 
haemorrhage with amyloidosis of the Dutch type ("HCHWA-D") (Levy et al 
(1990) Science 248:11224) found a substitution of Glu to Gin at residue 618 (using 
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the APP695 numbering system) in APP which is thought to result in the deposition 
of /3-amyloid in the cerebral vessels of these patients. The present inventors have 
identified a single base substitution, aCtoT transition at base pair 2149, has been 
found in part of the sequence of the APP gene in some cases of familial 
5 Alzheimer's disease. This base pair transition leads to an amino acid substitution, 
i.e., valine to isoleucine at amino acid 717 (APP^q) (see Yoshikai et al. (1990) 
Gene 87:257), close to the C-terminus of the /3-amyloid protein. This suggests that 
some cases of Alzheimer's disease are caused by mutations in the APP gene, and 
specifically mutations that change codon 717 such that it encodes an amino acid 

10 other than valine. 

Additionally, a further single base substitution, a T to G transition 
at adjacent base pair 2150, has been found in part of the sequence of the APP gene 
in other cases of familial Alzheimer's disease. This base pair transition leads to 
a different amino acid substitution, namely valine to glycine, at amino acid 717, 

15 thereby strengthening the argument that some cases of Alzheimer's disease are 
caused by mutations in the APP gene, specifically at codon 717. 

It is now clear that a mutation in the APP gene locus that results in 
a substitution of isoleucine for valine.at codon 717 (residue 642 in APP695) gives 
rise to AD in some families (Goate et al. (1991) Nature 342:704). A second APP 

20 allelic variant wherein glycine is substituted for valine at codon 717 is now 
identified, and is so closely linked to the AD phenotype as to indicate that allelic 
variants at codon 717 of the APP gene, particularly those encoding an amino acid 
other than valine, and more particularly those encoding a isoleucine, glycine, or 
phenylalanine, are pathogenic and/or pathognomonic alleles (Chartier-Harlin et al. 

25 (1991) Nature 251:844). 

Proteolysis on either side of the /3-amyloid (A4) region of APP may 
be enhanced or qualitatively altered by the specific mutations at codon 717, 
increasing the rate of /3-amyloid deposition and aggregation. Such codon 717 
mutations may increase /3-amyloid formation by providing a poorer substrate for 
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the main proteolytic pathway (cleavage at the constitutive site) or a better substrate 
for a competing, alternative cleavage pathway (at aberrant cleavage sites). 

DFFINITIONS 

5 A number of terms and expressions are used throughout the 

specification and, to facilitate the understanding thereof, the following definitions 
are provided: 

As used herein, "exon" refers to any segment of an interrupted gene 

that is represented in the mature RNA product. 
10 As used herein, ts intron :: refers to a segment of an interrupted gene 

that is not represented in the mature RNA product. Introns are part of the primary 

nuclear transcript but are spliced out to produce mRNA, which is then transported 

to the cytoplasm. 

As used herein, the phrase "gene sequence coding for amyloid 
15 protein precursor" may be interpreted to mean the -DNA and cDNA sequence as 

detailed by Yoshikai et al (1990) Gene £7:257 and Kang et al, loc. cit. . together 

with the promoter DNA sequence as described by Salbaum et al. (1988) EMBO 

7(21:2807. 

As used herein, the terms "label" or "labeled" refers to 
20 incorporation of a detectable marker (e.g., by incorporation of a radiolabeled 
nucleotide or by end-labeling with a terminal radiolabeled phosphate). DNA or 
RNA is typically labeled by incorporation of a radiolabeled nucleotide (H 3 , C 14 , 
S 35 , P 32 ) or a biotinylated nucleotide that can be detected by marked avidin (e.g., 
avidin containing a fluorescent marker or enzymatic activity) or digoxygeninylated 
25 nucleotide that can be detected by marked specific antibody. 

As used herein, "isoform", "APP", and "APP isoform" refer to a 
polypeptide that is encoded by at least one exon of the APP gene (Kitaguchi et al. 
(1988) Nature 221:530; Ponte et al., ikiiL, p.525; R.E. Tanzi, ibid,, p.528; de 
Sauvage and Octave (1989) Science 245:651; Golde et al. (1990) Neuron 4:253). 
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An APP isoform may be encoded by an APP allele (or exon thereof) thai is 
associated with a form of Alzheimer's disease or that is not associated with an AD 

disease phenotype. 

The terra °0-amyloid gene" is used herein as a synonym for the APP 
5 gene, as /3-amyloid is a protein product produced by a post-translational cleavage 

of an APP gene product 

As used herein, fragment" refers to a polypeptide of at least about 
9 amino acids, typically 50 to 75, or more, wherein the polypeptide contains an 
amino acid core sequence (listed in order from amino- to carboxy-terminal 

10 direction): 

.fle-Ala-Thr-Val-He-X-Ile-Thr-Leu- [SEQ ID NO:6] 
where X is any of the twenty conventional amino acids except valine, and 
particularly where X is isoleucine, glycine, or phenylalanine. A fragment may be 
a truncated APP isoform, modified APP isoform (as by amino acid substitutions, 

15 deletions, or additions outside of the core sequence), or other variant polypeptide 
sequence, but is not a naturally-occurring APP isoform or /3-amyloid polypeptide 
that is present in a human individual, whether affected by AD or not. If desired, 
the fragment may be fused at either terminus to additional amino acids, which may 
number from 1 to 20, typically 50 to 100, but up to 250 to 500 or more. 

20 As used herein, W APP751 H and "APP770" refer, respectively, to the 

751 and 770 amino acid residue long polypeptides encoded by the human APP 
gene (Ponte et al. loc. cit .: Kitaguchi et al. loc. cit. : Tanzi et al. to, cit,). 

As used herein, "codon 717* refers to the codon (Le., the 
trinucleotide sequence) that encodes the 717th amino acid position in APP770, or 
^ 25 the amino acid position in an APP isoform or fragment that corresponds to the 
717th position in APP770. For example but not limitation, a 670 residue long 
fragment that is produced by truncating APP770 by removing the 100 N-terminal 
amino acids has its 617th amino acid position corresponding to codon 717. In fact, 
as used herein, codon 717 refers to the codon that encodes the 698th amino acid 
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residue of APP751 [SEQ ID NO:2] and the 642nd amino acid residue of APP695 
[SEQIDNO:!]. 

As used herein, "human APP isoform or fragment" refers to an APP 
isoform or fragment that contains a sequence of at least 9 consecutive amino acids 
5 that is identical to a sequence in a human APP770, APP751, or APP695 protein 
that occurs naturally in a human individual, and wherein an identical sequence is 
not present in an APP protein that occurs naturally in a nonhuman species. 

A nucleic acid is "operably linked" when it is placed into a 
functional relationship with another nucleic acid sequence/ For instance, a 
10 promoter or enhancer is operably linked to a coding sequence if it affects the 
transcription of the sequence. With respect to transcription regulatory sequences, 
operably linked means that the DNA sequences being linked are contiguous and, 
where necessary to join two protein coding regions, contiguous and in reading 
frame. 

15 The term "corresponds to" is used herein to mean that a sequence 

is homologous (i.e. f is identical, not strictly evolutionarily related) to all or a 
portion of a reference sequence. In contradistinction, the term "complementary 
to" is used herein to mean that the complementary sequence is homologous to all 
or a portion of a reference sequence. For illustration, the nucleotide sequence 

20 "TATAC" corresponds to a reference sequence "TATAC" and is complementary 
to a reference sequence "GTATA". 

The term "transcriptional enhancement" is used herein to refer to 
functional property of producing an increase in the rate of transcription of linked 
sequences that contain a functional promoter. 

25 The term "agent" is used herein to denote a chemical compound, a 

mixture of chemical compounds, a biological macromolecule, or an extract made 
from biological materials such as bacteria, plants, fungi, or animal (particularly 
mammalian) cells or tissues. Agents are evaluated for potential biological activity 
by inclusion in screening assays described hereinbelow. 
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As used herein, the term "mutant" refers to APP alleles having 
missense mutations that are pathognomonic for a genetic predisposition for 
developing AD; specifically a mutation at codon 717 (as referenced by the amino 
acid sequence in APP770) of the APP gene, such that codon 717 encodes one of 
5 the nineteen amino acids that are not valine <x.e., glycine, methionine, alanine, 
serine, isoleucine, leucine, threonine, proline, histidine, cysteine, tyrosine, 
phenylalanine, glutamic acid, tryptophan, arginine, aspartic acid, asparagine, 
lysine, and glutamine), but preferably isoleucine, glycine, or phenylalanine. Thus 
a mutant APP770 polypeptide is an APP770 polypeptide that has an amino acid 

10 residue at position 717 that is not valine. Other mutant APP isoforms comprise 
a non-valine amino acid at the amino acid residue position that corresponds to 
codon 717 <Le., that is encoded by codon 717). Similarly, a mutant APP allele 
or a variant APP codon 717 allele is an APP allele that encodes an amino acid 
other than valine at codon 717 (referenced to the human APP770 deduced 

15 translation as described in the "codon 717" definition, supra), preferably 
isoleucine, glycine, or phenylalanine. Hence, an APP allele that encodes valine 
at codon 717 is a "wild-type" APP allele. 

It is apparent to one of skill in the art that nucleotide substitutions, 
deletions, and additions may be incorporated into the polynucleotides of the 

20 invention. However, such nucleotide substitutions, deletions, and additions should 
not substantially disrupt the ability of the polynucleotide to hybridize to one of the 
polynucleotide sequences shown in Figs. 5 and 6 under hybridization conditions 
that are sufficiently stringent to result in specific hybridization. 

"Specific hybridization" is defined herein as the formation of hybrids 

25 between a probe polynucleotide (e.g., a polynucleotide of the invention which may 
include substitutions, deletion, and/or additions) and a specific target 
polynucleotide (e.g., a polynucleotide having the sequence ), wherein the probe 
preferentially hybridizes to the specific target such that, for example, a band 
corresponding to a variant APP allele or restriction fragment thereof, can be 
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identified on a Southern blot, whereas a corresponding wild-type APP allele (Lfi» 
one that encodes valine at codon 717) is not identified or can be discriminated 
from a variant APP allele on the basis of signal intensity. Hybridization probes 
capable of specific hybridization to detect a single-base mismatch may be designed 
5 according to methods known in the art and described in Maniatis et al. t Molecular 
rinninp; A laboratory Manual . 2nd Ed., (1989), Cold Spring Harbor, N.Y. and 
Bergcr and Kimmel, Methods in En rvmologv. Volume 152. Guide to Molecular 
Pinning Techniques (1987), Academic Press, Inc., San Diego, CA; Gibbs et al. 
(1989) Nucleic Acids Res. 17:2437; Kwok et al. (1990) Nwteic Acids Pes, 

10 ia-999; Miyada et al, (1987) Methods Enzvmol. 154:94, each of which is 
incorporated herein by reference. The T m for oligonucleotides is calculated under 
standard conditions (1 M NaCl) to be [4°C x (G+C) + 2°C x (A+T)]. While 
the conditions of PCR differ from the standard conditions, this T m is used as a 
guide for the expected relative stabilities of oligonucleotides. Allele-specific 

15 primers are typically 13-15 nucleotides long, sometimes 16-21 nucleotides long, 
or longer; when short primers are used, such as a 14 nucleotide long primer, low 
annealing temperatures are used, typically 44 to 50°C, occasionally somewhat 
higher or lower depending on the base composition of the sequenced). 

2 o DESCRIPTION OF THE PREFERRED EMBODIMENTS 

potion of Mutant Codon 717 APP Alleles 

In an embodiment of the invention, the method involves identifying 
a genetic alteration at amino acid 717, which may cause the consensus Val to be 
changed, for example, to another hydrophobic residue. This will generally be 

2 5 performed on a specimen removed from the subject. Hydrophobic residues include 
Leu, Ala, De and Gly, the first three of which have aliphatic side chains. Phe also 
has a hydrophobic residue which may be appropriate. As indicated above, 
preferred residues include lie, Gly, and Phe (Murrell et al, (1991) Science 
254:97). 
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The feet that these mutations discussed above arc at the same codon 
may be a coincidence, but this seems unlikely on statistical grounds. There are 
two possibilities that may explain these data. First, substitution of the valine 
residue at codon 717 may result in increased beta-amyloid deposition due to 
changes in APP metabolism. Secondly, the variation in the sequence around this 
position may result in increased translation of APP mRNAs and thus cause AD by 
a route analogous to that by which AD is believed to be caused in Down Syndrome 
(Tanzi and Hyman (1991) Nature 350:564 and Rumble et al. (1989) N. Engl. J. 
Med. 320: 1446). In situ hybridization studies have shown that APP 717 mutations 
do not alter APP expression (Harrison et al. (1991) Neurorep. 2:152). 

The V717I (APP 717 Val-> He), V717G (APP 717 
Val->Gly) and V717F (APP 717 Val->Phe) mutations would destabilise a 
putative stem loop structure and destroy a possible iron-responsive element 
between base pairs 2131 and 2156 (Tanzi and Hyman, loc. at.). There are several 
other possible mutations which could also disrupt this structure, many of which 
would be silent at the protein level; yet these mutations specifically referred to 
have involved a change to the same amino acid, and no sUent changes or changes 
to other amino acids have been reported prior to the work described herein. 
Examination of sequence data from 10 other mammalian species (Johnstone et al. 
(1991) Mol Brain Res. 10:299) shows that while the valine residue at codon 717 
is conserved in all of them, the putative stem loop structure postulated from the 
human sequence (Tanzi and Hyman loc. cit.) would not be predicted to occur in 
either cattle or sheep; and in pig and mouse the consensus sequence for the 
iron-responsive elements is not present. Finally, such stem loop structures are 
believed to modulate gene translation by altering mRNA stability (Klausner and 
Harford (1989) Science 246:870); however, Harrison and colleagues (Harrison et 
al. loc. cit.) have shown by in situ hybridization that APP mRNAs are not grossly 
altered in the brain of an individual with the V717I mutation. For these reasons, 
it is believed likely that alterations in the rate of APP translation caused by the 
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specific mutations identified are not likely to be the key to their pathogenicity. 

The fact that the specific mutations discussed involve different 
changes (Val->Iie, Val->Gly, and Val->Phe) suggests that neither side-chain 
hydrophobicity nor side-chain bulk is the crucial issue. All examples of APP 
5 alleles that encode an amino acid other than valine at codon 717, cosegregate with 
FAD; suggesting that the valine that occurs at position 717 in wild-type APP770 
or APP751 is a critical amino acid residue for non-pathogenic APP proteolytic 
processing (i.e., by the constitutive cleavage pathway). 

The major metabolic pathway for the APP 'molecule involves 

.10 cleavage within the beta-amyloid fragment (Esch et al, Joe, .dr.). To .generate 
beta-amyloid, there must be a second pathway in which APP is cleaved outside this 
sequence. Such a cleavage would be likely to leave a stub of the APP molecule 
containing the beta-amyloid fragment embedded in the membrane. Possibly, the 
beta-amyloid-containing fragment which is generated by the second pathway is 

15 degraded by peptidase action; the reported mutations may be pathogenic because 
peptides which contain them may be more resistant to the actions of this peptidase. 
Therefore, genetic alterations in the APP gene which result in altered (generally 
reduced) degradative properties are particularly important in the application of the 
invention. There are several methodologies available from recombinant DNA 

20 technology which may be used for detecting and identifying a genetic mutation 

responsible for Alzheimer's disease. These include direct probing, polymerase 

* 

chain reaction (PCR) methodology, restriction fragment length polymorphism 
(RFLP) analysis and single strand conformational analysis (SSCA). 

Detection of point mutations using direct probing involves the use 
2 5 oligonucleotide probes which may be prepared synthetically or by nick translation . 

The DNA probes may be suitably labelled using, for example, a radiolabel, 
enzyme label, fluorescent label, biotin-avidin label and the like for subsequent 
visualization in for example a Southern blot hybridization procedure. The labelled 
probe is reacted with the sample DNA bound to a nitrocellulose or Nylon 66 
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substrate. The areas that cany DNA sequences complementary to the labelled 
DNA probe become labelled themselves as a consequence of the reannealling 
reaction. The areas of the filter that exhibit such labelling may then be visualized, 
for example, by autoradiography. 
5 Alternative probing techniques, such as ligase chain reaction (LCR) 

involve the use of mismatch probes, Lfi., probes which have full complementarity 
with the target except at the point of the mutation. The target sequence is then 
allowed to hybridize both with oligonucleotides having full complementarity and 
oligonucleotides containing a mismatch, under conditions which will distinguish 

10 between the two. By manipulating the reaction conditions it is possible to obtain 
hybridization only where there is fall complementarity. If a mismatch is present 
then there is significantly reduced hybridization. 

The polymerase chain reaction (PCR) is a technique that amplifies 
specific DNA sequences with remarkable efficiency. Repeated cycles of 

15 denaturation, primer annealing and extension carried out with a heat stable enzyme 
Taq polymerase leads to exponential increases in the concentration of desired DNA 
sequences. 

Given a knowledge of the nucleotide sequence encoding the 
precursor of amyloid protein of AD (Kang et al. he. cit., and Yoshikai, above) 

20 it may be possible to prepare synthetic oligonucleotides complementary to 
sequences which flank the DNA of interest. Each oligonucleotide is 
complementary to one of the two strands. The DNA is then denatured at high 
temperatures (e.g., 95°Q and then reannealed in the presence of a large molar 
excess of oligonucleotides. The oligonucleotides, oriented with their 3' ends 

25 pointing towards each other, hybridize to opposite strands of the target sequence 
and prime enzymatic extension along the nucleic acid template in the presence of 
the four deoxyribonucleotide triphosphates. The end product is then denatured 
again for another cycle. After this three-step cycle has been repeated several times, 
amplification of a DNA segment by more than one million fold can be achieved. 
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The resulting DNA may then be directly sequenced in rder to locate any genetic 
alteration. Alternatively, it may be possible to prepare oligonucleotides that will 
only bind to altered DNA, so that PCR will only result in multiplication of the 
DNA if the mutation is present. Following PCR, allele-specific oligonucleotide 
5 hybridization (Dihella et al. (1988) Lancet 1:497) may be used to detect the AD 
point mutation. Alternatively an adaptation of PCR called amplification of specific 
alleles (PASA) can be employed; this uses differential amplification for rapid and 
reliable distinction between alleles that differ at a single base pair. 

In yet another method PCR may be followed by restriction 
10 endonuclease digestion with subsequent analysis of the resultant products. The 
substitution of T for C at base pair 2149, found as a result of sequencing exon 17, 
creates a Bell restriction site. The creation of this restriction endonuclease 
recognition site facilitates the detection of the AD mutation using RFLP analysis 
or by detection of the presence or absence of a polymorphic Bell site in a PCR 
C^) 15 product that spans codon 717. 

For RFLP analysis, DNA is obtained, for example, from the blood 
of the subject suspected of having AD and from a normal subject is digested with 
the restriction endonuclease Bell and subsequently separated on the basis of size 
using agarose gel electrophoresis. The Southern blot technique can then be used 
20 to detects by hybridization with labeled probes, the products of endonuclease 
digestion. The patterns obtained from the Southern blot can then be compared. 
Using such an approach, DNA spanning an Alzheimer's mutation that creates or 
removes a restriction site at codon 717, such as the Bell site, is detected by 
determining the number of bands detected and compa^ng this number to a 
25 reference allele that has a codon 717 allele that encodes valine. 

Correspondingly, the substitution of G for T at base pair 2150 
creates a SfaNl restriction site (GCATC), which may be exploited in a manner 
similar to that described above, mutatis mutandis. Similar creation of additional 
restriction sites by nucleotide substitutions within codon 717, wherein the codon 
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717 encodes an amino acid other than valine, can be readily calculated by 
reference to the genetic code and a list of nucleotide sequences recognized by 
restriction endonucleases fPromega Protor-nls and Applications Guide. (1991) 
Promega Corporation, Madison, Wisconsin). 
5 Single strand conformational analysis (SSCA) (Orita et aL (1989) 

Genomics 5:874 and Orita et aL (1990) Genomics 6:271) offers a relatively quick 
method of detecting sequence changes which may be appropriate in at least some 
instances. 

PCR amplification of specific alleles (PASA) is a rapid method of 
10 detecting single-base mutations or polymorphisms (isewton et aL ii*o*j flffiaBS 
Acids Res. 17:2503; Nichols et al. (1989) Genomics 5:535; Okayama et al. (1989) 
T,T ? h. nin.Med. 114:105: Sarkar etal. (1990) Anal, Bipchem, 186:64; Sommer 
etal. no«o^ Mavn CHin. Proc. 64:1361: Wu (1989)Proff, N?Tl. Acad, SsL U.S.A. 
S&2757; and Dutton et al. (1991) Biotechnioues 11:700). PASA (also known as 
v y 15 glide specific amplification) involves amplification with two oligonucleotide 

primers such that one is aUele-specific. The desired allele is efficiently amplified, 
while the other allele(s) is poorly amplified because it mismatches with a base at 
or near the 3* end of the aUele-specific primer. Thus, PASA or the related method 
of PAMSA may be used to specifically amplify one or more variant APP codon 
20 717 alleles. Where such amplification is done on genetic material (or RNA) 
obtained from an individual, it can serve as a method of detecting the presence of 
one or more variant APP codon 717 alleles in an individual. 

Similarly, a method known a ligase chain reaction (LCR) has been 
used to successfully detect a single-base substitution in a hemoglobin allele that 
25 causes sickle cell anemia (Barany et al. (1991) Proc, Natl, Acad, Set. U.S.A. 
81:189; Weiss (1991) Science 254:1292). LCR probes may be combined, or 
multiplexed for simultaneously screening for multiple different mutations. Thus, 
one method of screening for variant APP codon 717 alleles is to multiplex at least 
two, and preferably all, LCR probes that will detect an APP allele having a codon 
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717 that does n t encode valine, but that does encode an amino acid. The universal 
genetic code provides the degenerate sequences of all the encoded non-valine 
amino acids, thus LCR probe design for detecting any particular variant codon 717 
allele is straightforward, and multiplexed pools of such LCR probes may be 
5 selected in the discretion of a practitioner for his particular desired use. 

In performing diagnosis using any of the above techniques or 
variations thereof, it is preferable that several individuals are examined. These 
may include an unaffected parent, an affected parent, an affected sibling, an 
unaffected sibling as well as other perhaps more distant family members. 

10 

frfrvtel Animals and Cell Lines 

Having identified specific mutations in codon 717 of the 0-amyloid 

gene as a cause of familial Alzheimer's disease (FAD), it is possible, using genetic 

manipulation, to develop transgenic model systems and/or whole cell systems 
15 containing the mutated FAD gene (or a portion thereof) for use, for example, as 

model systems for screening for drugs and evaluating drug effectiveness. 

Additionally, such model systems provide a tool for defining the underlying 

biochemistry of APP and 0-amyloid metabolism, which thereby provides a basis 

for rational drug design. 
20 One type of cell system can be naturally derived. For this, blood 

samples from the affected subject must be obtained in order to provide the 

necessary cells which can be permanently transformed into a lymphoblastoid cell 

line using, for example, Epstein-Barr virus. 

Once established, such cell lines can be grown continuously in 
25 suspension culture and may be used for a variety of in vitro experiments to study 

APP expression and processing. 

Since the FAD mutation is dominant, an alternative method for 

constructing a cell line is to engineer genetically a mutated gene, or a portion 

thereof spanning codon 717, into an established (either stably or transiently) cell 
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line of choice. Sisodia (1990) Science 248:492) has described the insertion of a 
normal APP gene, by transfection, into mammalian cells. Oltersdorf et al. ((1990) 
/. Biol. Cherru 265:4492) describe the insertion of APP into immortalized 
eukaryotic cell lines. 

5 Baculovirus expression systems are useful for high level expression 

of heterologous genes in eukaryotic cells. Knops et al. (1991) /. Biol. Chem. 
266(1 1):7285 describes the expression of APP using such a system. 

In yet a further use of the present method, it may be possible to 
excise the mutated gene (i.e., a variant APP codon 717 gene) for use in the 

10 creation of transgenic animals containing the mutated gene. For example, an 
entire human variant APP codon 717 allele may be cloned and isolated, either in 
parts or as a whole, in a cloning vector (e.g., XCharon35, cosmid, or yeast 
artificial chromosome). The human variant APP codon 717 gene, either in parts 
or in whole, may be transferred to a host nonhuman animal, such as a mouse. As 
\j) 15 a result of the transfer, the resultant transgenic nonhuman animal will preferably 

express one or more variant APP codon 717 polypeptides. Most preferably, a 
transgenic nonhuman animal of the invention will express one or more variant APP 
codon 717 polypeptides in a neuron-specific manner (Wirak et al. (1991) JMBQ 
1Q:289). This may be accomplished by transferring substantially the entire human 

20 ■ APP gene (encoding a codon 717 mutant) including the 4.5 Wlobase sequence that 
is adjacent to and upstream of the first major APP transcriptional start site. 

Alternatively, one may design minigenes encoding variant APP 
codon 717 polypeptides. Such minigenes may contain a CDNA sequence encoding 
a variant APP codon 717 polypeptide, preferably full-length, a combination of APP 

25 gene exons, or a combination thereof, linked to a downstream polyadenylation 
signal sequence and an upstream promoter (and preferably enhancer). Such a 
minigene construct will, when introduced into an appropriate transgenic host (e.g., 
mouse or rat), express an encoded variant APP codon 717 polypeptide, most 
preferably a variant APP codon 717 polypeptide that contains either an isoleucine, 

0 
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glycine, or phenylalanine residue at codon 717 of APP770 or the corresponding 
position in an APP isoform or fragment. 

One approach to creating transgenic animals is to target a mutation 
to the desired gene by homologous recombination in an embryonic stem (ES) cell 
5 line in vitro followed by microinjection of the modified ES cell line into a host 
blastocyst and subsequent incubation in a foster mother (see Frohman and Martin 
(1989) Cell 5£: 145). Alternatively, the technique of microinjection of the mutated 
gene, or a portion thereof, into a one-cell embryo followed by incubation in a 
foster mother can be used. Various uses of transgenic animals, particularly 

10 transgenic animals that express a wild-type APP isoform or fragment, are disclosed 
in Wirak et al (1991) EMBO, 10(2):289; Schilling et al. (1991) Gene 98(2):225; 
Quon et al (1991) Nature 252:239; Wirak et al (1991) Science 252:323; and 
Kawabata et al. (1991) Nature 251:476. Additional methods for producing 
transgenic animals are known in the art. 

15 Alternatively, site-directed mutagenesis and/or gene conversion can 

be used to mutate a murine (or other nonhuman) APP gene allele, either 
endogenous or transfected, such that the mutated allele does not encode valine at 
the codon position in the mouse APP gene that corresponds to codon 717 (of 
APP770) of the human APP gene (such position is readily identified by homology 

20 matching of the murine APP gene or APP protein to the human APP gene or 
APP770 protein). Preferably, such a mutated murine allele would encode 
isoleucine or glycine or phenylalanine at the corresponding codon position. 



Therapeutics 

25 Having detected the genetic mutation in the gene sequence coding 

for /3-amyloid protein in an individual not yet showing overt signs of familial AD, 
using the method of the present invention, it may be possible to employ gene 
therapy, in the form of gene implants, to prevent the development of the disease. 

Additional embodiments directed to modulation of the production of 
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variant APP proteins include methods that employ specific antisense 
polynucleotides complementary to all or part of a variant APP sequence, or for 
some embodiments a wild-type APP sequence. Such complementary antisense 
polynucleotides may include nucleotide substitutions, additions, deletions, or 
transpositions, so long as specific hybridization to the relevant target sequence, 
a variant APP codon 717 sequence, is retained as a property of the 
polynucleotide. Thus, an antisense polynucleotide must preferentially bind to a 
variant APP <i.e., codon 717 does not encode valine) sequence as compared to a 
wild-type APP (i.e., codon 717 does encode valine). It is evident that the 
antisense polynucleotide must reflect the exact nucleotide sequence of ihe variant 
allele (or wild-type allele where desired) and not a degenerate sequence. 

Complementary antisense polynucleotides include soluble antisense 
RNA or DNA oligonucleotides which can hybridize specifically to a variant APP 
MRNA species and prevent transcription of the mRNA species and/or translation 
of the encoded polypeptide (Ching et al. (1989) Pme. Natl. Acad, Sci. U.S.A. 
££:10006; Broder et aL (1990) Ann. Int. Med . 112:604; Loreau et al. (1990) 
FEBS Letters 271:53-56); Holcenberg et al. W091/11535; U.S. No. 7,530,165 
("New human CRIPTO gene" - publicly available through Derwent Publications 
Ltd., Rochdale House, 128 Theobalds Road, London, UK); WO91/09865; 
WO91/04753; WO90/13641; and EP 386563, each of which is incorporated herein 
by reference). The antisense polynucleotides therefore inhibit production of the 
variant APP polypeptides. Antisense polynucleotides may preferentially inhibit 
transcription and/or translation of mRNA corresponding to a variant (or wild-type) 
polypeptides can inhibit T lymphocyte activation. 

Antisense polynucleotides may be produced from a heterologous 
expression cassette in a transfectant cell or transgenic cell or animal, such as a 
transgenic neural, glial, or astrocytic cell, preferably where the expression cassette 
contains a sequence that promotes cell-type specific expression (Wirak et al. loc. 
at.). Alternatively, the antisense polynucleotides may comprise soluble 



WO 92/13069 



PCT/GB92/00123 



23 

oligonucleotides that are administered to the external milieu, either in the culture 
medium in vitro or in the circulatory system or interstitial fluid in vivo . Soluble 
antisense polynucleotides present in the external milieu have been shown to gain 
access to the cytoplasm and inhibit translation of specific mRNA species. In some 
5 embodiments the antisense polynucleotides comprise methylphosphonate moieties. 
For general methods relating to antisense polynucleotides, see AntiSfflSC RNA and 
DNA . (1988), D.A. Melton, Ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY). 

* 

10 Mutant APP Antigens and Monoclonal Antibodies 

In yet another aspect of the invention, having detected a genetic 
alteration in a gene sequence coding for APP, it may be possible to obtain samples 
of the altered jS-amyloid protein from the same source. This protein may be 
derived from the brain tissue of a subject diagnosed as suffering from Alzheimer's 

15 disease, or more preferably are produced by recombinant DNA methods or are 
synthesized by direct chemical synthesis on a solid support. Such polypeptides will 
contain an amino acid sequence of an APP variant allele spanning codon 717. 
Examples of such sequences are: 

(a) -De-Ala-Thr-Val-ne-Gly-Ile-Thr-Leu- [SEQIDNO:7] 

20 (b) -ne-Ala-Thr-Val-ne-Met-ne-Thr-Leu- [SEQIDNO:8] 

(c) -ne-Ala-Thr-Val-Ile-Ala-Ile-Thr-Leu- [SEQIDNO:9] 

(d) -He-Ala-Thr-Val-Ile-Ser-Ile-Thr-Leu- [SEQIDNO:10] 

(e) -ne-Ala-Thr-Val-Ile-Ile-ne-Thr-Leu- [SEQ ID NO:ll] 

(f) -ne-Ala-Thr-Val-Ile-Leu-ne-Thr-Leu- [SEQ ID NO: 12] 
25 (g) -Ile-Ala-Thr-Val-Ile-Thr-Ile-Thr-LeuT [SEQ ID NO: 13] 

(h) -Ile-Ala-Thr-Val-Ile-Pro-Ile-Thr-Leu- [SEQ ID NO: 14] 
CO -Ile-Ala-Thr-Val-Ile-His-Ile-Thr-Leu- [SEQ ID NO: 15] 
0) -Ile-Ala-Thr-Val-Ile-Cys-Ile-Thr-Leu- [SEQ ID NO: 16] 
(k) -De-Ala-Thr-Val-Ile-Tyr-Ile-Thr-Leu- [SEQ ID NO: 17] 
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0) -ne-Ala-Thr-Val-ne-Phe-ne-Thr-Leu- [SEQIDNO:18) 
(ra) -ne-Ala-Thr-Val-ne-Glu-ne-Thr-Leu- [SEQ ID NO:19] 
(n) -ne-Ala-Thr-Val-ne-Trp-ne-Thr-Leu- [SEQIDNO:20] 
(o) -De-Ala-Thr-Val-ne-Arg-ne-Thr-Uu- [SEQroNO:21] 
(p) -He-Ala-Thr-Val-ne-Asp-ne-Thr-Leu- [SEQ ID NO:22] 
(q) -ile-Ala-Thr-Val-Ile-Asn-ne-Thr-Leu- [SEQIDNO:23] 
(r) -ne-Ala-Thr-Val-He-Lys-Ile-Thr-Leu- [SEQIDNO:24] 
(s) -De-Ala-Thr-Val-Ile-Gln-Ile-Thr-Leu- [SEQ ID NO:25] 

Using such polypeptide material it may then be possible to prepare 
antisera and monoclonal antibodies using, for example, the method of Kohler and 
Milstein ((1975) Nature 256:495). Such monoclonal antibodies could then form 

the basis of a diagnostic test- 
Such variant APP polypeptides may be used to immunize an animal 
15 for the production of specific antibodies. These, antibodies may comprise a 
polyclonal antiserum or may comprise a monoclonal antibody produced; by 
hybridoma cells. For general methods to prepare antibodies, see Antibodies; A 
j ? hn^tnrv Manual. (1988) E. Harlow and D. Lane, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, which is incorporated herein by reference. 

For example but not for limitation, a recombinantly produced 
fragment of a variant APP codon 717 polypeptide can be injected into a mouse 
along with an adjuvant so as to generate an immune response. Murine 
immunoglobulins which bind the recombinant fragment with a binding affinity of 
at least 1 x 10 7 M" 1 can be harvested from the immunized mouse as an antiserum, 
and may be further purified by affinity chromatography or other means. 
Additionally, spleen cells are harvested from the mouse and fused to myeloma cells 
to produce a bank of antibody-secreting hybridoma cells. The bank of hybridomas 
can be screened for clones that secrete immunoglobulins which bind the 
recombinantly produced fragment with an affinity of at least 1 x ltf M 1 . More 
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specifically, immunoglobulins that bind to the variant APP codon 717 polypeptide 
but have limited crossreactivity with a wild-type (i.e., codon 717 encodes valine) 
APP polypeptide are selected, either by preabsorption with wild-type APP or by 
screening of hybridoma cell lines for specific idiotypes that preferentially bind the 
5 variant as compared to the wild-type. 

The nucleic acid sequences of the present invention capable of 
ultimately expressing the desired variant APP polypeptides can be formed from a 
variety of different polynucleotides (genomic or cDNA, RNA, synthetic 
oligonucleotides, etc.) as well as by a variety of different techniques. 

i0 As stated previously, the DNA sequences will be expressed in hosts 

after the sequences have been operably linked to positioned to ensure the 
functioning of) an expression control sequence. These expression vectors are 
typically replicable in the host organisms either as episomes or as an integral part 
of the host chromosomal DNA. Commonly, expression vectors will contain 

15 selection markers, e.g.. tetracycline resistance or hygromycin resistance, to permit 
detection and/or selection of those cells transformed with the desired DNA 
sequences (SS£, U.S. Patent 4,704,362, which is incorporated herein by 
reference). 

Polynucleotides encoding a variant APP codon 717 polypeptide may 
2 o include sequences that facilitate transcription (expression sequences) and translation 

of the coding sequences, such that the encoded polypeptide product is produced. 

Construction of such polynucleotides is well known in the art and is described 

further in Maniatis et al. Molecular Cloning: A Laboratory Manufrl , 2nd Ed. 

(1989), Cold Spring Harbor, N.Y. For example, but not for limitation, such 
25 polynucleotides can include a promoter, a transcription termination site 

(polyadenylation site in eukaryotic expression hosts), a ribosome binding site, and, 

optionally, an enhancer for use in eukaryotic expression hosts, and, optionally, 

sequences necessary for replication of a vector. 
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£. soji is one prokaryotic host useful particularly for cloning the 
DNA sequences of the present invention. Other microbial hosts suitable for use 
include bacilli, such as Bacillus syMlllS, and other enterobacteriaceae, such as 
Salmon^, Serratia . and various Pgudomonas species. In these prokaryotic 
5 hosts, one can also make expression vectors, which will typically contain 
expression control sequences compatible with the host cell an origin of 
replication). In addition, any number of a variety of well-known promoters will 
be present, such as the lactose promoter system, a tryptophan (trp) promoter 
system, a beta-lactamase promoter system, or a promoter system from phage 

10 lambda. Hie promoters will typically control expression, optionally with an 
operator sequence, and have ribosome binding site sequences and the like, for 
initiating and completing transcription and translation. 

Other microbes, such as yeast, may also be used for expression. 
Sareharomvces is a preferred host, with suitable vectors having expression control 

15 sequences, such as promoters, including 3-phosphoglycerate kinase or other 
glycolytic enzymes, and an origin of repUcation, termination sequences and the like 
as desired. 

In addition to microorganisms, mammalian tissue cell culture may 
also be used to express and produce the polypeptides of the present invention (SSS, 

20 Winnacker, "From Genes to Clones," VCH Publishers, N.Y., N.Y. (1987), which 
is incorporated herein by reference). Eukaryotic cells are actually preferred, 
because a number of suitable host cell lines capable of secreting intact human 
proteins have been developed in the art, and include the CHO cell lines, various 
COS cell lines, HeLa cells, myeloma cell lines, Jurkat cells, etc. Expression 

2 5 vectors for these cells can include expression control sequences, such as an origin 
of replication, a promoter, an enhancer (Queen et al. (1986) Immunol, Rev , £2:49, 
which is incorporated herein by reference), and necessary processing information 
sites, such as ribosome binding sites, RNA splice sites, polyadenylation sites, and 
transcriptional terminator sequences. Preferred expression control sequences are 
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promoters derived from immunoglobulin genes, SV40, Adenovirus, Bovine 
Papilloma Vims, and the like. The vectors containing the DNA segments of 
interest (e.g., polypeptides encoding a variant APP polypeptide) can be transferred 
into the host cell by well-known methods, which vary depending on the type of 
5 cellular host For example, calcium chloride transfection is commonly utilized for 
prokaryotic cells, whereas calcium phosphate treatment or electroporation may be 
used for other cellular hosts. (See. generally. Maniatis, et al. Mfllggular Cloning; 
A Laboratory Manual . Cold Spring Harbor Press, (1982), which is incorporated 
herein by reference.) 

10 Alternatively, homologous recombination may be used to insert an 

APP mutant sequence into a host genome at a specific site, for example, at a host 
APP locus. In one type of homologous recombination, one or more host 
sequence(s) are replaced; for example, a host APP allele (or portion thereof) is 
replaced with a mutant APP allele (or portion thereof). In addition to such gene 

15 replacement methods, homologous recombination may be used to target a mutant 
APP allele to a specific site other than a host APP locus. Homologous 
recombination may be used to produce transgenic non-human animals and/or cells 
that incorporate mutant APP alleles. 

The method lends itself readily to the formulation of test kits which 

20 can be utilized in diagnosis. Such a kit would comprise a carrier being 
compartmentalized to receive in close confinement one or more containers wherein 
a first container may contain suitably labelled DNA probes. Other containers may 
contain reagents useful in the localization of the labelled probes, such as enzyme 
substrates. Still other containers may contain a restriction enzyme (such as Bcfl), 

25 buffers and the like, together with instructions for use. 

EXPERIMENTAL EXAMPLES 
The following examples are provided for illustration and are not 
intended to limit the invention to the specific example provided. 
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PVAMPLE 1 - refection of a Va1->Tle mutatinn in the fl-amvloid (APP> gene 

The segregation of AD and markers along the long arm of 
chromosome 21 in a single family with autopsy-confirmed Alzheimer's disease (see 
Fig. 1) were examined. DNA samples were available from a total of six affected 
5 and 33 unaffected and at risk individuals. 

The APP gene in an affected family member was analyzed by 
polymerase chain reaction (PCR) direct sequencing using intronic primers 
(Gyllensten, U. in PCR Technology, Ed. Erlich, H.A., Stockton Press, 45-60, 
1989; Yoshikai et al. (1990) Gene 87:257). (Fig. 2). The primers were made 
10 according to the manufacturer's protocol using a Gene Assembler Plus (Pharmacia 
LKB). 

PCR was carried out using the following intronic primers in order 
to amplify exon 17 of the APP gene: 

O 15 [A] 5*-CCTCATCCAAATGTCCCCGTCATT-3' [SEQ ID NO:26] AND 

[B] S'-GCCTAATTCTCrcATAGTCTTAATTCCCAC-Sf [SEQ ID NO 27] 

PCR conditions were 94 °C for lOmin to denature; then 35 cycles of 60°C for 
lmin, 72°C for 3min, 94°C for 1.5min; and a single cycle of 72°C for lOmin. 
20 The reaction was carried out using lOmM tris-HCl pH 8.3, 50mM potassium 
chloride, 0.01% gelatin, 1.5raM magnesium chloride, 200/iM of dNTPs, 50 
pmoles of each PCR primer and 1 unit of Taq polymerase. The total final reaction 

volume was 25/d. 

A second PCR reaction was then performed with a final 

25 concentration of 50 pmol of primer [A] and 0.5 pmol of primer [B]. The PCR 

product was purified on a centricon 100 microconcentrator (Amicon) and used 

directly for sequencing with the SEQUENASE kit (version 2.0, United States 

Biochemical Corp.; the word SEQUENASE is a trade mark) following the 

manufacturer's protocol. 

O 
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Ex n 17 was sequenced first because it encodes part of the 
0-amyloid peptide and is the site of the mutation (at APP693) leading to Hereditary 
Cerebral Haemorrhage with Amyloidosis-Dutch Type (HCHWA-D). 

Sequencing of exon 17 revealed a C to T transition at base pair 
2149, causing a valine to isoleucine change at amino acid 717 (Fig. 2 and Fig. 3). 

This C to T transition creates a Bell restriction site enabling 
detection within the PCR product (Fig. 4). BcR digests were carried out at 50°C 
for 2-4 hours, as recommended by the manufacturer, then electrophoresed in 3% 
agarose. 

Screening by PCR of 100 unrelated, normal individuals and 14 cases 
(9 families) of familial late onset disease failed to demonstrate this substitution. 
Screening of 1 1 (9 families) cases of early onset familial disease revealed the Bell 
restriction site in two affected individuals from an unrelated family. The genetic 
data show that the disease loci are linked to the missense mutation. Also, failure 
to detect this polymorphism in 200 normal chromosomes supports the contention 

that it is a pathogenic mutation. 

The valine to isoleucine substitution occurs within the 
transmembrane domain two residues from the C-terminus of the /3-amyloid peptide. 
Computer analysis predicts that the substitution makes the transmembrane more 
hydrophobic and might thus anchor the protein more firmly within the membrane. 
The position of the substitution, two residues from the C-terminus of the /3-amyloid 
peptide may be of significance to the origin of the deposited peptide. This finding 
links Alzheimer's disease to HCHWA-D, a disease in which amyloid deposition 
is due to a mutation closer to the N-terminus but withir the /3-amyloid peptide 
(Levy el al loc. cit.). 



I 
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p yAVfPT R 2 - Preparation of a c e lt line containing a defective g-amyloid (APP) 
gene 

10ml of fresh blood are collected from each individual suffering 
from familial Alzheimer's disease. Lymphocytes are purified from the blood on a 
5 Percoll gradient and mixed with Epstein-Barr virus (EBV). Hie cells are then 
plated out in medium supplemented with 10% foetal calf serum, antibiotics, 
glutamine and Cyclosporin A to kill the T lymphocytes. B lymphocytes which are 
infected by EBV become immortalized and establish a permanent cell line derived 

from the B cells of the patient. 
10 a lymphoblastoid cell line, AC21, has been deposited with the 

European Collection of Animal Cell Cultures, Porton Down. 



P XAM PT ' F - 2 - D**"*"" nf a V a1 -> filv mutation in the g-amvloid (APP) gene 
f~\ 15 A pedigree, designated F19 and shown in Fig. 5, which has 

autopsy-confirmed AD with an onset age of 59+4 years was identified by 
observing that an allele of the highly polymorphic dinucleotide repeat marker 
GT12 (D21S210), which is located close to the APP gene, co-segregated with the 
disease. Linkage analysis gave a peak lod score between the marker and the 
2 o disease of 3.02 at a recombination fraction of zero, as the following table shows: 
Theta. 0 0.01 0.05 0.1 0.2 0.3 0.4 
Lod 3.02 2.97 2.75 2.47 1.86 1.22 0.6 

Lod scores were calculated with seven liability classes modelling 
age-dependent penetrances from 0.01 to 0.95 with a phenocopy rate of 0.001 and 
25 a gene frequency of 0.001 using MLINK from the LINKAGE package (Lathrop 
et al (1984) Proc. Nad. Acad. Sci. USA 81:3443). 

APP exon 17 sequences in an affected and an unaffected member of 
F19 were determined. In the affected member, there was a G->T transition at 
position 2150, as can be seen from Fig. 6. 
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The amplification of exon 17 was performed as described in 
Example 1 above and Chartier-Harlin et al. (1991) Neurosci. Letts. 129:134, with 
the following modifications: (a) the amplification primer sequences were: 

ATA-ACC-TCA-TCC-AAA-TGT-CCC-C [SEQ ID NO:28] and 
5 GTA-ACC-CAA-GCA-TCA-TGG-AAG-C [SEQ ID NO:29]; and 

(b) the PCR conditions were 94°C/10 minutes then 35 cycles of 60°C/1 minute, 
72°C/1 minute, 94°C/1 minute, followed by 72°C/5 minutes. 

SO pmol of the second primer were used to generate single stranded 
product, which was then purified (Chartier-Harlin et al. loc. tit.). The purified 
10 product was sequenced with the SEQUENASE lrit (2.0) (Trade mark; USB) using 
a primer of sequence: 

AAA-TGA-AAT-TCT-TCT-AAT-TGC-G [SEQ ID NO:30]. 

The presence of the T->C transition creates gel artefacts which 
were resolved by the inclusion of inosine (SEQUENASE kit) in the sequencing 
15 reaction. 

Direct sequencing of exons 7 and 16 from affected individuals from F19 (Chartier- 
Harlin et al. loc. cit.) shows that these were of normal sequence and SSCA (Orita 
et al. loc. cit.) and Orita et al.) failed to identify changes in exons 2, 3, 7, 9, 12, 
20 13 or 15. SSCA of exon 17 detects both APP693 (Levy, et al. loc. cit. and Hardy 
et al. (1991) Lancet 337:1342-1343) and APP717 Val->He under standard 
screening conditions and, when modified APP717 Val->Gly. 

EXAMPLE 4 - Production of Transgenic Animals with Mutant APP Allele 
25 Generation of the constructs : The vector plink was constructed by 

cloning polylinker between the PvuII and EcoRI sites of pBR322 such that the 
Hindm end of the polylinker was adjacent to the PvuII site. The ligation 
destroyed both the EcoRI and PvuII sites associated with the pBR322 segments. 
The 700bp Hpal to EcoRI fragment of pSV2neo (Southern and Berg (1982) L 
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frfnl Appl Genet. 1:327) that contains the SV40 polyadenylation signal was 
cloned into the Hpal to EcoRl sites of plink to generate pNotSV. The 200 bp 
Xhol to PstI fragment of pL2 containing the SV40 16S/lgS splice site (Okayama 
and Berg (1983) Mnl. Cell Biol. 2:280) was isolated, blunted with Menow, then 
cloned into the Hpal site of pNotSV to generate pSplice. The 2.3kb Nrul to Spel 
fragment of pAPP695 containing the coding region of the cDNA for APP (Tanzi 
et al. (1987) Science 235:880) was cloned into the Nrul to Spel site of pSplice to 
generate pd695. The same strategy was used to generate pd751 using the cDNA 
for APP751 (Tanzi et al. (1988) Nature 331:528). A variety of promoters have 
been inserted into the pd695 and pd75 i vectors by using the unique Nrul or the 
Hindm and Nrul sites. 

Generation of pshAPP695 & mhAPP751: The construct 
pAmyproBam was generated by cloning the 1.5kb BamHI fragment of the APP 
cDNA into the BamHI site of pucl9 xHamy. The 700 bp Hindm to Asp718 
fragment of the pAmyproBam (similar to the 700 bp BamHI to Asp718 fragment 
described in Salbaum er al. (1988) EMBO 7:2807) was cloned into the Hindm to 
Asp718 sites of pd695 and pd751 to yield pshAPP695 and pshAPP751. 

pAPP69S.andoAPP751 : The pAPP695 and pAPP751 vectors were 
generated by a three-way ligation of the 3.0 kb EcoPJ to Xhol fragment of 
pAmyProBam, the 1.5kb Xhol to Spel fragment of APP751cDNA, and the Spel 
to EcoPJ site of pd751. 

Generation of pNSE751(+471 : The pNSE751 (+47) was 
constructed using a three-way ligation of the Hindm to Kpnl fragment of pNSE6 
(Forss-Pettereffl/. (1990) Neuron 5:187). The Kpnl to BstYl fragment of pNSE6 
and a partial BamHI (-47nt relative to the ATG) to Hindm fragment of pAPP751. 
This resulted in the generation of a Kpnl fragment that was cloned into the Kpnl 
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sites f pNSE751(+47). The BstYl/Bam fusion results in the loss f both sites. 

Generation of pNSF751 : This vector was generated using a four 
primer two-step PCR protocol (Higuchi et al (1988) Nucl. Acids Res. 16:7351) 
that resulted in a direct fusion of the NSE initiation codon to the APP coding 
region. Oligonucleotides C2, 1072, 1073, and A2 (see Nucleotide Sequences, 
infra.) were used to generate a PCR product. The Kpnl fragment was generated 
by digestion with the restriction enzyme. The Kpnl fragment was used to replace 
a similar fragment in pNSE751(+47). 



10 



Generation of nNSE751-Harriv and pNSE751-Dutch : The Hardy 
(APP642 Val->Ile of APP695) and Dutch (APP618 Gln->Glu of APP695) 
mutations were introduced using a four primer two-PCR protocol. Both sets of 
reactions used the same "outside primers" with the "inside primers" containing the 
15 appropriate mutations. This resulted in the generation of Bgin to Spel fragment 
after digestion, that contained either the Dutch or the Hardy mutation. The Bgin 
to Spel fragment of pNSE75i was replaced by the mutated fragment to generate 
the appropriate vector. The presence of the mutation was conformed by sequence 
analysis of the vectors. 



20 



Generation of t)NSE751-Hardv and nNSE751-Dtftoh : The Hardy VI 
(APP642 V to I), Hardy VG (APP642 V to G), and Dutch (APP618 E to Q) 
mutations were introduced using the four primer two-step PCR protocol (Higuchi 
et al (1988)). The Hardy VI mutant was generated using primers 117/738, 922, 
25 923, and 785; Hardy VG mutant was generated using primers 117/738, 1105, 
1106, and 785; Dutch mutant was generated using primers 117/738, 1010, 1011, 
and 785. In all these mutations the 700 bp Bgin to Spel fragment was isolated by 
digestion of the PCR product with the restriction enzymes, then cloned into the 
same sites of pNSE751. The mutations were confirmed by sequence analysis. 
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r TrP ^nnofnNFH751 : The human NFH gene (Lees et al. (1988) 
EMBO 7(7): 1947) was isolated from a genomic library using a rat NFH cDNA as 
a probe (Lieberburg et al. (1989) Proc. Natl. Adds. Res. USA 86:2463). An SstI 
fragment was subcloned into the pSK vector. A pair of PCR primers was 
5 generated to place a Nrul site at the 3* end of the 150 bp amplified fragment 
immediately upstream of the initiation codon of the NFH gene. The 5* end 
contains a Kpnl site 150nt upstream of the initiation codon. The final construction 
of pNFH751 was generated by a three-way ligation of the 5.5b HindHI to Kpnl 
fragment of pNFH8.8, the Kpnl to Nrul PCR generated fragment, and the Huidm 
10 to Nrul fragment of pd751. The sequence surrounding the PGR generated fusion 
at the initiation codon was confirmed by sequence analysis. The Dutch and Hardy 
variants of pNFH751 were generated by substitution of the 600 bp BglH to Spel 
fragment from a sequence confirmed mutated vector for the same fragment of 
pNFH751. The presence of the mutation was confirmed by hybridization with the 
vD 15 mutated oligomer or by sequence analysis. 

n ation of nThv751 : The pThy751 vector was generated by a 
three-way ligation. The HindHI to BamHI fragment of pThy8.2 which was 
isolated from a human genomic library (Chang et al. (1985) Proc. Natl. Acad. Sci. 
USA 82:3819), the synthetic fragment ThyAPP, and the HindHI to Nrul fragment 
ofpd751. 
ThyAPP: 

CAGACTGAGATCCCAGAACCCTAGGTCrGACTCTAGGGTCTTGG[SEQ 
ID NO:31] 



20 



25 



(WratinnnfnThvClOO : This pThyClOO construct was generated 
by a three-way ligation. The 3.6kb HindHI to BamHI fragment of pThy8.2, the 
synthetic fragment ThyAPP2, and the HindHI to BglH fragment of pd751 or 
pNSE751 Dutch or pNSE751 Hardy were ligated to yield pThyClOO. 
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ThyAPP2: 

C^GACTGAGATCCCAGAACCGATCCTAGGTCTGACTCTAGGGTCTTGG 

[SEQ ID NO:32] 

The region around the initiation codon was confirmed by sequence 

5 analysis. 

preparation of DNA for injection : The transgene for injection was 
isolated from the corresponding vector of interest for digestion with NotI and gel 
electrophoresis. The transgene was purified by using the Gene Clean kit (BiolOl), 
10 then further purified on an Hutip or KPLC purified on a Nucleegen 4000 column. 

Microinjection : The transgene was injected at 2-20 mcg/ml into the 
most convenient pronucleus (usually the male pronucleus) of FVB or B6D2F2 one- 
cell embryos (Manipulating the Mouse Embryo, B. Hogan, F. Constantini, E. 
0^ 15 ' Lacy, rold Spring Harbor . 1986). The injected embryos were cultured overnight. 

Embryos that split to the two-cell stage were implanted into pseudo-pregnant 
female CD1 mice. The mice were weaned at approximately 21 days. Samples of 
DNA obtained from tail biopsy were analyzed by Southern blot using a transgene 
specific probe (usually the SV40 3*s splice and polyadenylation signal sequences). 
20 Transgenic mice harboring at least one copy of the transgene were identified. 

Use of Transgenic Mice: A mouse that expresses the hAPP gene 
or its variants can be used to test the pathogenesis of amyloid deposition and 
therapeutic intervention designed to modulate amyloid deposition. 

Biochemical analysis of the transgenic mice reveals possible 
25 intermediates in the catabolism of APP that are likely precursors to beta-amyloid. 

This analysis can be carried out in the animal or in primary tissue culture of the 
expressing cells. 

The animal can be used to test potential therapeutic agents. The test 
group of mice is treated with the test compound administered in an appropriate 
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fashion for a set period. At the conclusion of the test period, the animals are 
assessed behaviourally, biochemically, and histologically for any possible effects 
of the test compound. The exact protocol depends on the anticipated mechanism 
of action of the test compound. Compounds that may have utility in treating AD 
5 can be identified using this approach. 
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SEQUENCE LISTING 



GENERAL INFORMATION : 

(i) APPLICANT: 

(A) NAME: Imperial College of Science, Technology & 

Medicine (not US) 

(B) STREET: Sherfield Building, Exhibition Road, 

(C) CITY: London 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP) : SW7 2AZ 

(A) NAME: HARDY, John Anthony (US only) 

(B) STREET: 187 Drakefell Road 

(C) CITY: London 
(E) COUNTRY: GB 

■(F) POSTAL CODE (ZIP) : SE4 

(A) NAME: GOATE, Alison Mary (US only) 

(B) STREET: 100 High Street, Hampton Wick, 

(C) CITY: Kingston-on-Thames 

(D) STATE: Surrey 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP) : KT1 4DQ 

(A) NAME: MULLAN, Michael John (US only) 

(B) STREET: 'Suncoast Gerontology Ctr, 12901 Bruce B. 

Downs Blvd. MDC 50, 

(C) CITY: Tampa 

(D) STATE: Florida 

(E) COUNTRY: US 

(F) POSTAL CODE (ZIP): 33612 

(A) NAME: CHARTI ER-HARLIN , Marie-Christine (US only) 

(B) STREET: 63 Francis Road 

(C) CITY: London 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP): E10 6PN 

(A) NAME: OWEN, Michael John (US only) 

(B) STREET: Four Hedges, Castlehill, LLanblethian, 

(C) CITY: Cowbridge 

(D) STATE: South Glamorgan 

(E) COUNTRY: GB 

(ii) TITLE OF INVENTION: Test and Model for Alzheimer's Disease 
(iii) NUMBER OF SEQUENCES: 44 

(iv) COMPUTER READABLE FORM: 
Not Applicable 

(V) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB92/ 
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I) INFORMATION FOR SEQ ID NO Sis 

(i) SEQUENCE CHARACTERISTICS S 

(A) LENGTHS 695 amino acids 

(B) TYPES amino acid 

(C) STRANDEDNESS % single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPES protein 

(Xi) SEQUENCE DESCRIPTION s SEQ ID NO sis 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Jhr Ala Arg 
^ 5 ^0 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Fro 
20 25 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 

50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 9° 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 * 25 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
* 165 170 1/5 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 
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val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 

225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 * 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin* Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 

325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 

405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 

485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 
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val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 

565 570 575 

Thr Arci Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 
1 y 580 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 . 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
6i0 615 620 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 *>*° 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 

645 650 655 



His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 

Phe Phe Glu Gin Met Gin Asn 
690 695 
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>) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 • 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 m 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 HO 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 ~ 135 s 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 
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Val Val Glu Val Ala Glu Glu Glu GlU Val Ala Glu Val Glu Glu Glu 
225 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 

275 280 28 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met lie 

290 295 
Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala *Pro Phe Phe 
305 310 31S 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 

325 

Cys Met Ala Val Cys Gly Ser Ala He Pro Thr Thr Ala Ala Ser Thr 
340 345 

Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp Glu Asn Glu 

355 3 «0 365 

His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala Lys His Arg 



370 



375 



380 



Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala Glu Arg Gin 



385 3 90 

Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val He Gin His Phe 



405 



410 



Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn Glu Arg Gin 

420 425 
Gin Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met Leu Asn Asp 



445 



435 44 ° 
Arg Arg Arg Leu Ala Leu Glu Asn Tyr He Thr Ala Leu Gin Ala Val 



450 4 55 
Pro Pro Arg Pro Arg His Val Phe Asn Met Leu Lys Lys Tyr Val Arg 
465 * 4 ™ 475 

Ala Glu Gin Lys Asp Arg Gin His Thr Leu Lys His Phe Glu His Val 



485 



490 



Arg Met Val Asp Pro Lys Lys Ala Ala Gin He Arg Ser Gin Val Met 



500 



505 



Thr His Leu Arg Val He Tyr Glu Arg Met Asn Gin Ser Leu Ser Leu 
515 * 520 
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Leu Tyr Asn Val Pro Ala Val Ala Glu Glu He Gin Asp Glu Val Asp 
530 535 540 

Glu Leu Leu Gin Lys Glu Gin Asn Tyr Ser Asp Asp Val Leu Ala Asn 
545 ^ 550 555 560 

Met He Ser Glu Pro Arg He Ser Tyr Gly Asn Asp Ala Leu Met Pro 

565 570 575 

Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Leu Pro Val Asn Gly 
580 585 590 

Glu Phe Ser Leu Asp Asp Leu Gin Pro Trp His Ser Phe Gly Ala Asp 
595 600 605 

Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val Asp Ala Arg 

610 615 620 

Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser Gly Leu Thr 
625 630 635 640 

Asn lie Lys Thr Glu Glu lie Ser Glu Val Lys Met Asp Ala Glu Phe 

645 650 655 

Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys Leu Val Phe Phe 

660 665 . 670 

Ala Glu Asp Val Gly Ser Asn Lys Gly Ala lie He Gly Leu Met Val 
675 680 685 

Gly Gly Val Val He Ala Thr Val He Val He Thr Leu Val Met Leu 
690 695 700 

Lys Lys Lys Gin Tyr Thr Ser He His His Gly Val Val Glu Val Asp 
705 " 710 715 720 

Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gin Gin Asn 

725 730 735 

Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
740 745 750 
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INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 770 amino acids 

(B) TYPES amino acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPES protein 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID NOs3s 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp T*r Ala Arg 
1 5 1° 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 43 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 

50 55 
Thr Lys Glu Gly He Leu Glh Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 

Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 

100 105 
He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 



Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 

130 I 35 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 1 

- Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

1 165 170 1 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 



180 



Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 2 
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Val Val Glu val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 

225 230 235 ™> 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 «3 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met He 
290 295 300 

Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala- Pro Phe Phe 
305 31° 315 

Tvr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 
* 1 325 330 335 

cys Met Ala Val Cys Gly Ser Ala Met Ser Gin Ser Leu Leu Lys Thr 
340 345 350 

Thr Gin Glu Pro Leu Ala Arg Asp Pro Val Lys Leu Pro Thr Thr Ala 
355 360 365 

Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp 
370 375 380 

Glu Asn Glu His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala 
385 390 395 400 

Lys His Arg Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala 
3 405 410 415 

Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val He 
420 425 430 

Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn 
435 440 445 

Glu Arg Gin Gin Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met 
450 455 460 

Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn Tyr He Thr Ala Leu 
465 470 475 480 

Gin Ala Val Pro Pro Arg Pro Arg His Val Phe Asn Met Leu Lys Lys 

485 490 495 

Tvr Val Arg Ala Glu Gin Lys Asp Arg Gin His Thr Leu Lys His Phe 
1 500 505 510 

Glu His Val Arg Met Val Asp Pro Lys Lys Ala Ala Gin He Arg Ser 

515 " 520 525 
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Gin Val Met Thr His Leu Arg Val lie Tyr Glu Arg Met Asn Gin Ser 
530 535 540 

Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala Glu Glu lie Gin Asp 



545 



550 



555 



Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn Tyr Ser Asp Asp Val 

565 570 575 

Leu Ala Asn Met lie Ser Glu Pro Arg He Ser Tyr Gly Asn Asp Ala 

' "~ 590 



580 



585 



Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Leu Pro 
595 600 605 

Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin Pro Trp His Ser Phe 
610 615 620 

Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val 



625 



630 



635 



Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser 



645 650 655 



Gly Leu Thr Asn He Lys Thr Glu Glu He Ser Glu Val Lys Met Asp 



660 



665 



Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys Leu 
675 680 685 

Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He He Gly 
690 695 700 

Leu Met Val Gly Gly Val Val He Ala Thr Val He Val He Thr Leu 



705 



710 



715 



Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He His His Gly Val Val 

725 730 735 

Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met 
740 745 750 

Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met 
755 760 765 



Gin Asn 
770 



1- 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2088 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATGCTGCCCG GTTTGGCACT GCTCCTGCTG GCCGCCTGGA CGGCTCGGGC GpTGGAGGTA 60 

CCCACTGATG GTAATGCTGG CCTGCTGGCT GAACCCCAGA TTGCCATGTT CTGTGGCAGA 120 

CTGAACATGC ACATGAATGT CCAGAATGGG AAGTGGGATT CAGATCCATC AGGGACCAAA 180 

ACCTGCATTG ATACCAAGGA AGGCATCCTG CAGTATTGCC AAGAAGTCTA CCCTGAACTG 240 

CAGATCACCA ATGTGGTAGA AGCCAACCAA CCAGTGACCA TCCAGAACTG GTG CAAGCGG 300 

GGCCGCAAGC AGTGCAAGAC CCATCCCCAC TTTGTGATTC CCTACCGCTG CTTAGTTGGT 360 

GAGTTTGTAA GTGATGCCCT TCTCGTTCCT GACAAGTGCA AATTCTTACA CCAGGAGAGG 420 

ATGGATGTTT GCGAAACTCA TCTTCACTGG CACACCGTCG CCAAAGAGAC ATGCAGTGAG 480 

AAGAGTACCA ACTTGCATGA CTACGGCATG TTGCTGCCCT GCGGAATTGA CAAGTTCCGA 540 

GGGGTAGAGT TTGTGTGTTG CCCACTGGCT GAAGAAAGTG ACAATGTGGA TTCTGCTGAT 600 

GCGGAGGAGG ATGACTCGGA TGTCTGGTGG GGCGGAGCAG ACACAGACTA TGCAGATGGG 660 

AGTGAAGACA AAGTAGTAGA AGTAGCAGAG GAGGAAGAAG TGGCTGAGGT GGAAGAAGAA 720 

GAAGCCGATG ATGACGAGGA CGATGAGGAT GGTGATGAGG TAGAGGAAGA GGCTGAGGAA 780 

CCCTACGAAG AAGCCACAGA GAGAACCACC AGCATTGCCA CCACCACCAC CACCACCACA 840 

GAGTCTGTGG AAGAGGTGGT TCGAGTTCCT ACAACAGCAG CCAGTACCCC TGATGCCGTT 900 

GACAAGTATC TCGAGACACC TGGGGATGAG AATGAACATG CCCATTTCCA GAAAGCCAAA 960 

GAGAGGCTTG AGGCCAAGCA CCGAGAGAGA ATGTCCCAGG TCATGAGAGA ATGGGAAGAG 1020 

GCAGAACGTC AAGCAAAGAA CTTGCCTAAA GCTGATAAGA AGGCAGTTAT CCAGCATTTC 1080 

CAGGAGAAAG TGGAATCTTT GGAACAGGAA GCAGCCAACG AGAGACAGCA GCTGGTGGAG 1140 

ACACACATGG CCAGAGTGGA AGCCATGCTC AATGACCGCC GCCGCCTGGC CCTGGAGAAC 1200 

TACATCACCG CTCTGCAGGC TGTTCCTCCT CGGCCTCGTC ACGTGTTCAA TATGCTAAAG 1260 
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AAGTATGTCC 


GCGCAGAACA GAAGGACAGA CAGCACACCC 


T A A A G CATTT 


CGAGCATGTG 


1320 


CGCATGGTGG 


ATCCCAAGAA AGCCGCTCAG ATCCGGTCCC 


& fsmr atg A c 


ACACCTCCGT 

AWlv w X www a 


1380 


GTGATTTATG 


AGCGCATGAA TCAGTCTCTC TCCCTGCTCT 




TGCAGTGGCC 


1440 


GAGGAGATTC 


AGGATGAAGT TGATGAGCTG CTTCAwAAALj 


AGCAAAACTA 


TTCAGATGAC 


1500 


GTCTTGGCCA 


ACATGATTAG TGAACCAAGG ATCAGTTACG 


r a A ACGATGC 


TCTCATGCCA 


1560 


TCTTTGACCG 


AAACGAAAAC CACCGTGGAG CTCCTTCCCG 


X vj An X vj vwiun 


GTTCAGCCTG 

^9 X X wX»w w w w 


1620 


GACGATCTCC 


AGCCGTGGCA TTCTTTTGGG GCTGACTCTG 


Iwwwxiw'wWtrl 


r A CAG AAAAC 


1680 


GAAGTTGAGC 


CTGTTGATGC CCGCCCTGCT GCCGACCGAG 


lj X oAwWiu 


TCGACCAGGT 


1740 


TCTGGGTTGA 


CAAATATCAA GACGGAGGAG ATGTCTGAAG 


TGAAGATGGA 


i%» UAbAAl X V- 


1800 


CGACATGACT 


CAGGATATGA AGTTCATCAT CAAAAATTGG 


TGTTCTTTGC 


AGAAGATGTG 


1860 


GGTTCAAACA 


AAGGTGCAAT CATTGGACTC ATGGTGGGCG 


GTGTTGTCAT 


AGCGACAGTG 


1920 


ATCGTCATCA 


CCTTGGTGAT GCTGAAGAAG AAACAGTACA 


CATCCATTCA 


TCATGGTGTG 


1980 


GTGGAGGTTG 


ACGCCGCTGT CACCCCAGAG GAGCGCCACC 


TGTCCAAGAT 


GCAGCAGAAC 


2040 


GGCTACGAAA 


ATCCAACCTA CAAGTTCTTT GAGCAGATGC 


AGAACTAG 




2088 
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(2) INFORMATION FOR SEQ ID NO; 5s 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 2265 base pairs 

(B) TYPES nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPES DNA (genomic) 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID NOs5s 

ATGCTGCCCG GTTTGGCACT GCTCCTGCTG GCCGCCTGGA CGGCTCGGGC GCTGGAGGTA 60 

CCCACTGATG GTAATGCTGG CCTGCTGGCT GAACCCCAGA TTGCCATGTT CTGTGGCAGA 120 

CTGAACATGC ACATGAATGT CCAGAATGGG AAGTGGGATT CAGATCCATC AGGGACCAAA 180 

ACCTGCATTG ATACCAAGGA AGGCATCCTG CAGTATTGCC AAGAAGTCTA CCCTGAACTG 240 

CAGATCACCA ATGTGGTAGA AGCCAACCAA CCAGTGACCA TCCAGAACTG GTGCAAGCGG 300 

GGCCGCAAGC AGTGCAAGAC CCATCCCCAC TTTGTGATTC CCTACCGCTG CTTAGTTGGT 360 

GAGTTTGTAA GTGATGCCCT TCTCGTTCCT GACAAGTGCA AATTCTTACA CCAGGAGAGG 420 

ATGGATGTTT GCGAAACTCA TCTTCACTGG CACACCGTCG CCAAAGAGAC ATGCAGTGAG 480 

AAGAGTACCA ACTTGCATGA CTACGGCATG TTGCTGCCCT GCGGAATTGA CAAGTTCCGA 540 

GGGGTAGAGT TTGTGTGTTG CCCACTGGCT GAAGAAAGTG ACAATGTGGA TTCTGCTGAT 600 

GCGGAGGAGG ATGACTCGGA TGTCTGGTGG GGCGGAGCAG ACACAGACTA TGCAGATGGG 660 

AGTGAAGACA AAGTAGTAGA AGTAGCAGAG GAGGAAGAAG TGGCTGAGGT GGAAGAAGAA 720 

GAAGCCGATG ATGACGAGGA CGATGAGGAT GGTGATGAGG TAGAGGAAGA GGCTGAGGAA 780 

CCCTACGAAG AAGCCACAGA GAGAACCACC AGCATTGCCA CCACCACCAC CACCACCACA 840 

GAGTCTGTGG AAGAGGTGGT TCGAGAGGTG TGCTCTGAAC AAGCCGAGAC GGGGCCGTGC 900 

CQAGCAATGA TCTCCCGCTG GTACTTTGAT GTGACTGAAG GGAAGTGTGC CCCATTCTTT 960 

TACGGCGGAT GTGGCGGCAA CCGGAACAAC CGGAACAACT TTGACACAGA AGAGTACTGC 1020 

ATGGCCGTGT GTGGCAGCGC CATTCCTACA ACAGCAGCCA GTACCCCTGA TGCCGTTGAC 1080 

AAGTATCTCG AGACACCTGG GGATGAGAAT GAACATGCCC ATTTCCAGAA AGCCAAAGAG 1140 

AGGCTTGAGG CCAAGCACCG AGAGAGAATG TCCCAGGTCA TGAGAGAATG GGAAGAGGCA 1200 

GAACGTCAAG CAAAGAACTT GCCTAAAGCT GATAAGAAGG CAGTTATCCA GCATTTCCAG 1260 
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GAGAAAGTGG AATCTTTGGA ACAGGAAGCA GCCAACGAGA GACAGCAGCT GGTGGAGACA 
CACATGGCCA GAGTGGAAGC CATGCTCAAT GACCGCCGCC GCCTGGCCCT GGAGAACTAC 
ATCACCGCTC TGCAGGCTGT TCCTCCTCGG CCTCGTCACG TGTTCAATAT GCTAAAGAAG 
TATGTCCGCG CAGAACAGAA GGACAGACAG CACACCCTAA AGCATTTCGA GCATGTGCGC 
ATGGTGGATC CCAAGAAAGC CGCTCAGATC CGGTCCCAGG TTATGACACA CCTCCGTGTG 
ATTTATGAGC GCATGAATCA GTCTCTCTCC CTGCTCTACA ACGTGCCTGC AGTGGCCGAG 
GAGATTCAGG ATGAAGTTGA TGAGCTGCTT CAGAAAGAGC AAAACTATTC AGATGACGTC 
TTGGCCAACA TGATTAGTGA ACCAAGGATC AGTTACGGAA ACGATGCTCT CATGCCATCT 
TTGACCGAAA CGAAAACCAC CGTGGAGCTC CTTCCCGTGA ATGGAGAGTT CAGCCTGGAC 
GATCTCCAGC CGTGGCATTC TTTTGGGGCT GACTCTGTGC CAGCCAACAC AGAAAACGAA 
GTTGAGCCTG TTGATGCCCG CCCTGCTGCC GACCGAGGAC TGACCACTCG ACCAGGTTCT 
GGGTTGACAA ATATCAAGAC GGAGGAGATC TCTGAAGTGA AGATGGATGC AGAATTCCGA 
CATGACTCAG GATATGAAGT TCATCATCAA AAATTGGTGT TCTTTGCAGA AGATGTGGGT 
TCAAACAAAG GTGCAATCAT TGGACTCATG GTGGGCGGTG TTGTCATAGC GACAGTGATC 
GTCATCACCT TGGTGATGCT GAAGAAGAAA CAGTACACAT CCATTCATCA TGGTGTGGTG 
GAGGTTGACG CCGCTGTCAC CCCAGAGGAG CGCCACCTGT CCAAGATGCA GCAGAACGGC 
TACGAAAATC CAACCTACAA GTTCTTTGAG CAGATGCAGA ACTAG 



1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2265 
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INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

lie Ala Thr Val lie Xaa lie Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

He Ala Thr Val He Gly He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

He Ala Thr Val He Met He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

He Ala Thr Val He Ala He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

He Ala Thr Val He Ser He Thr Leu 
1 5 



PCT/GB92/00123 

WO 92/13069 



56 

INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

He Ala Thr Val He He He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

lie Ala Thr Val lie Leu lie Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

He Ala Thr Val He Thr He Thr Leu 
1 5 



WO 92/13069 



PCT/GB92/00123 



59 

INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

He Ala Thr Val He Pro He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

He Ala Thr Val He His He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

lie Ala Thr Val lie Cys He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

He Ala Thr Val He Tyr He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

He Ala Thr Val He Phe He Thr Leu 
1 5 



WO 92/13069 



PCT/GB92/00123 



64 

INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

He Ala Thr Val He Glu He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

lie Ala Thr Val lie Trp lie Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

He Ala Thr Val He Arg He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

lie Ala Thr Val lie Asp lie Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

He Ala Thr Val He Asn He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

He Ala Thr Val He Lys He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

lie Ala Thr Val lie Gin He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CCTCATCCAA ATGTCCCCGT CATT 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
GCCTAATTCT CTCATAGTCT TAATTCCCAC 



30 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ATAACCTCAT CCAAATGTCC CC 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GTAACCCAAG CATCATGGAA GC 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AAATGAAATT CTTCTAATTG CG 



WO 92/13069 



PCT/GB92/00123 



76 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CAGACTGAGA TCCCAGAACC CTAGGTCTGA CTCTAGGGTC TTGG 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CAGACTGAGA TCCCAGAACC GATCCTAGGT CTGACTCTAG GGTCTTGG 48 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CGACCAGGTT GTGGGTTGAC AAATA 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
AATCTATTCA TGCACTAGTT TGATACAGC 
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(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
ACAGTGATCA TCATCACCTT G 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CAAGGTGATG ATGATCACTG T 



o 



o 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
AGCGACAGTG ATCGGCATCA CCTTGGTG 



WO 92/13069 



PCT/GB92/00123 



83 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CACCCAGGTG ATGCCGATCA CTGTCGCT 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
ACCCACATCT TGTGCAAAGA ACAC 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GTGTTCTTTG CACAAGATGT GGGT 
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(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CCCAGCCATC ATGCTGCCCG GGTTGGC 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GCCAACCCGG GCAGCATGAT GACTGGGATC TC 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
ACCTGCCACT ATACTGGAAT A 
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(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TGTGCATGTT CAGTCTGCCA C 
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wpATTsrr .aimed IS: 

1. An isolated polynucleotide comprising a nucleic acid sequence 
encoding a codon 717 mutant of human amyloid precursor protein 770 (APP770), 

5 or an isoform or fragment of APP770 having a mutant amino acid residue at the 
position encoded by codon 717. 

2. An isolated polynucleotide of claim 1, wherein the amino acid 
at the position encoded by codon 717 is an isoleucine, glycine, or phenylalanine. 

3. An isolated polynucleotide of claim 1, wherein the nucleic acid 
sequence is a cDNA. 

4. A composition comprising a polynucleotide probe capable of 
O 15 specifically hybridizing to an amyloid precursor protein 770 (APP770) allele 

exhibiting a mutation at codon 717. 

5. A composition of claim 4, wherein codon 717 of the mutant 
allele encodes an isoleucine, phenylalanine, or glycine. 

20 

6. A composition of claim 4, wherein the probe is labeled. 

7. A composition of claim 4, wherein the probe comprises at 
least about 10 nucleotides spanning amino acid 717 of the APP770 allele. 

25 

8. A transgenic host comprising a nucleic acid segment 
encoding a position 717 mutant of human amyloid precursor protein 770 
(APP770), or an APP770 isoform or fragment of APP770 having the mutation. 
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9. A host of claim 8, which is a primary or immortalized 
eukaryotic cell line. 

10. A host of claim 8, which is a bacterium. 

11. A host of claim 8, wherein the segment is integrated into the 

host genome. 

12. A host of claim 8, which is a non-human-animal having the 
DNA segment incorporated into its germline and which is capable of expressing 
the mutant APP770 protein. 

13. A host of claim 12, wherein the mutant APP770 protein is 
the sole APP770 protein produced by the animal. 

14. A transgenic non-human animal with germ cells or somatic 
cells comprising a heterologous gene encoding a position 717 mutant amyloid 
precursor protein 770 (APP770), which gene upon expression promotes 
neuropathological characteristics of Alzheimer's disease in the animal. 

15. A cultured human primary or immortalized cell, comprising 
a nucleic acid segment encoding a position 717 mutant of human amyloid precursor 
protein 770 (APP770), or an APP770 isoform of fragment of APP770 having the 
mutation. 

16. A method of screening for an agent capable of treating 
Alzheimer's disease, comprising: 

contacting a host of claim 8 with the agent; and monitoring 
expression or processing of proteins encoded by the mutant APP770 gene. 
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17. A diagnostic method for determining an inherited 
predisposition to Alzheimer's disease in a subject, comprising detecting in the 
subject the presence of an allele of amyloid precursor protein (APP), an isoform 
or fragment thereof, wherein said allele has a sequence polymorphism at a position 
corresponding to codon 717 of APP770. 

18. A method of claim 17, wherein said sequence polymorphism 
is a nucleotide substitution, whereby an isoleucine or glycine is substituted at 
codon 717 of APP770. 

19. A method of claim 17, wherein said sequence polymorphism 
is a single nucleotide substitution. 

20. A method according to claim 17, wherein the detecting step 
15 comprises sequencing a genomic DNA segment from chromosome 21 of the 

subject 

21. A method according to claim 17, wherein the detecting step 
comprises (i) mixing a nucleic acid sample from the subject with one or more 

20 polynucleotide probes capable of hybridizing selectively to an APP gene allele in 
a reaction and (u) monitoring the reaction to determine the presence of the gene 
allele in the sample, thereby indicating whether the subject is at risk for 
Alzheimer's disease. 



25 



22. A method according to claim 20, wherein one probe is a 
polynucleotide comprising a sequence of at least about 10 nucleotides spanning 
codon 717 of an APP770. 
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23. A method according to claim 22, wherein the probes are 
oligonucleotides capable of priming polynucleotide synthesis in a polymerase chain 
reaction, wherein a reaction product comprises a sequence of at least 25 contiguous 
nucleotides from exon 17 of the APP gene. 

24. A method according to claim 21, wherein at least one 
oligonucleotide specifically hybridizes to a sequence present in an intron or 
flanking region of an APP770 gene. 

o 

25 . A method according to claim 21 , wherein the monitoring step 
comprises analyzing sequencing gel reaction products from the PCR reaction. 

26. A method according to claim 21 , wherein the monitoring step 
comprises analyzing an autoradiograph of a Bell digest of reaction products from 
the PCR reaction. 

i 

27. A method according to claim 17, wherein the detecting step 
comprises (i) mixing in an immunological assay an APP770 or isoform protein 
sample from the subject with an antibody reagent specific for the allele and 
(ii) monitoring the assay to determine specific binding between the antibody 
reagent and the protein sample, thereby indicating whether the subject is at risk for 
Alzheimer's disease. 

28. A method according to claim 27, wherein the antibody 
reagent is a monoclonal antibody specifically reactive with an antigenic determinant 
specific for an allele. 

29. A method for genetic analysis of a human subject which 
comprises detecting the presence or absence of at least one polymorphism at codon 
717 of an APP770 gene of an amyloid precursor protein (APP) gene in the subject 
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30- A method according to claim 29, wherein the polymorphism 
is detected by digesting genomic DNA from the subject with at least one restriction 
endonuclease and hybridizing resulting fragments with a detecting probe 

31. A composition comprising a polypeptide free from human 

proteins, comprising a core sequence: 

ne-Ala-Thr-Val-ne-X-He-Thr-Leu- [SEQ ID NO:6] 

wherein X is any of the twenty conventional amino acids except valine. 

32. A transgenic nonhuman animal containing a polypeptide of 

claim 31. 

33. A transgenic nonhuman animal of Claim 32, wherein the 
polypeptide is present in the brain. 

34. An isolated polynucleotide, comprising a nucleic acid 
sequence encoding a mutant human APP allele that cosegregates with a genetic 
predisposition to Alzheimer's disease. 

35. An isolated polypeptide of Claim 34, wherein said mutant 
human APP allele comprises a codon 717 mutant 



36. A method of determining a genetic predisposition of a subject 
to Alzheimer's disease, the method comprising detecting in the subject's DNA the 
25 presence of an allele of a gene encoding amyloid precursor protein (APP). 

37 A method as claimed in claim 36, wherein the step of 
detection is carried out on material removed from, and not returned to, the 
subject's body. 
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38. A method as claimed in claim 36 or 37, wherein the allele 
of the gene encodes a substitution mutant of APP. 

39. A method as claimed in claim 38, wherein a single amino 
5 acid is substituted for another. 
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